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FROM INFINITE URN SCHEMES TO DECOMPOSITIONS OF 
SELF-SIMILAR GAUSSIAN PROCESSES 

OLIVIER DURIEU AND YIZAO WANG 


Abstract. We investigate a special case of infinite urn schemes first considered by Karlin 
141 ] . especially its occupancy and odd-occupancy processes. We first propose a natural ran¬ 


domization of these two processes and their decompositions. We then establish functional 
central limit theorems, showing that each randomized process and its components con¬ 
verge jointly to a decomposition of certain self-similar Gaussian process. In particular, the 
randomized occupancy process and its components converge jointly to the decomposition 
of a time-changed Brownian motion B(t Q ), a € (0,1), and the randomized odd-occupancy 
process and its components converge jointly to a decomposition of fractional Brownian 
motion with Hurst index H £ (0,1/2). The decomposition in the latter case is a special 
case of the decompositions of bi-fractional Brownian motions recently investigated by Lei 
and Nualart H3- The randomized odd-occupancy process can also be viewed as correlated 
random walks, and in particular as a complement to the model recently introduced by 
Hammond and Sheffield [l2| as discrete analogues of fractional Brownian motions. 


1. Introduction 


We consider the classical infinite urn schemes, sometimes referred to as the balls-in-boxes 
scheme. Namely, given a fixed infinite number of boxes, each time a label of the box is 
independently sampled according to certain probability /i, and a ball is thrown into the 
corresponding box. This model has a very long history, dating back to at least Bahadur 
[lj. For a recent survey from the probabilistic point of view, see Gnedin et al. 11]. In 
particular, the sampling of the boxes forms naturally an exchangeable random partition 
of N. Exchangeable random partitions have been extensively studied in the literature, 
and have connections to various areas in probability theory and related fields. See the 
nice monograph by Pitman 22] for random partitions and more generally combinatorial 
stochastic processes. For various applications of the infinite urn schemes in biology, ecology, 
computational linguistics, among others, see for example Bunge and Fitzpatrick |f|. 

In this paper, we are interested in a specific infinite urn scheme. More precisely, we 
consider /i as a probability measure on N := {1,2,...} which is regularly varying with 
index 1/a, a e (0,1). See the definition in Section 12/II This model was first considered by 
Karlin 14] and we will refer to it by the Karlin model in the rest of the paper. 
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We start by recalling the main results of Karlin 
dent sampling from p for each round i > 1, and 


14). Let represents the indepen- 


n 

dn,fc := 'y ^ 1 {Yi=k} 
i =1 


be the total counts of sampling of the label k in the first n rounds, or equivalently how 
many balls thrown into the box k in the first n rounds. In particular, Karlin investigated 
the asymptotics of two statistics: the total number of boxes that have been chosen in the 
first n rounds, denoted by 

z *( n ) ■= 

fc>i 

and the total number of boxes that have been chosen by an odd number of times in the 
first n rounds, denoted by 


U*(n) : = 


k> 1 


1 


{Vn.fc is odd}- 


The processes Z* and U* are referred to as the occupancy process and the odd-occupancy 
process , respectively. While Z* is a natural statistics to consider in view of sampling 
different species, the investi gati on of U* is motivated via the following light-bulb-switching 
point of view from Spitzer [26). Each box k may represent the status (on/off) of a light 


bulb, and each time when k is sampled, the status of the corresponding light bulb is 
switched either from on to off or from off to on. Assume that all the light bulbs are off at 
the beginning. In this way, U*(n ) represents the total number of light bulbs that are on at 
time n. 

Central limit theorems have been established for both processes in [m), in form of 


( 1 ) 


n 


EZ* 


n 


(7 ri 


Af(0, 


ON 


and 


U*{n) — EU*[n) 


(7 r 


W(0, 


cr. 


U ) 


for some normalization <j n , with <r| and a/j explicitly given as the variances of the limiting 
normal distributions, and where denotes convergence in distribution. We remark that 
a 2 is of the order 77 “, up to a slowly varying function at infinity. 

The next seemingly obvious task is to establish the functional central limit theorems for 
the two statistics. However, to the best of our knowledge, this has not been addressed 
in the literature. Here by functional central limit theorems, or weak convergence, we are 
thinking of results in the form of (in terms of Z*) 


( 2 ) 


Z*([nt\) -EZ*([nt\) 


(7 n 


£G[0,1] 


{Z *(t)}t G [0,!], 


in space -D([0,1]) for some normalization sequence a n and a Gaussian process Z*. In view 
of ([T]) and the fact that a 2 has the same order as n a , the scaling limit Z*, if exists, is 
necessarily self-similar with index a/2. 

In this paper, instead of addressing this question directly, we consider a more general 
framework by introducing the randomization to the Karlin model (see Section 12.11 for the 
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exact definitions). The randomization of the Karlin model reveals certain rich structure of 
the model. In particular, it has a natural decomposition. Take the randomized occupancy 
process Z e for example. We will write 

Z e {n ) = Z[{n) + Z e 2 (n) 

and prove a joint weak convergence result in form of 

-(Zf(Li!iJ),z{(Ln(J),z'([n(J)) K [0,i] =*• ■ 

On 

in D([0, l]) 3 , such that 

Z = TL\ + Z 2 with Zi and Z 2 independent. 


In other words, the limit trivariate Gaussian process (Z^f), Z 2 (f), Z(f)) te [ 0i i] can be con¬ 
structed by first considering two independent Gaussian processes Zi and Z 2 with covariance 
to be specified, and then set Z(t) := Zi(t)+Z 2 (t), t e [0,1]; in this way its finite-dimensional 
distributions are also determined. We refer such results as weak convergence to the decom¬ 
position of a Gaussian process. Similar results for the randomized odd-occupancy process 
are also obtained. Here is a brief summary of the main results of the paper. 


• As expected, various self-similar Gaussian processes appear in the limit. In this 
way, the randomized Karlin model and its components, including Z* and U* as 
special quenched cases, provide discrete counterparts of several self-similar Gaussian 
processes. These processes include notably the fractional Brownian motion with 
Hurst index H = a/2, the bi-fractional Brownian motion with parameter H = 
1/2, K = a, and a new self-similar process TL\. 

• Moreover, in view of the weak convergence to the decomposition, the randomized 
Karlin model are discrete counterparts of certain decompositions of self-similar 
Gaussian processes. The randomized occupancy process and its two components 
converge weakly to a new decomposition of the time-changed Brownian motion 
(B(i"))t> 0 , a 6 (0,1) (Theorem [T]). The randomized odd-occupancy process and 
its two components converge weakly to a decomposition of the fractional Brownian 
motion with Hurst index H = a/2 e (0,1/2) (Theorem [2D- This decomposition is 
a particular case of the decompositions of bi-fractional Brownian motion recently 
discovered by Lei and Nualart (l7j |. 


Self-similar processes have been extensively studied in probability theory and related 
fields [9], often related to the notion of long-range dependence (25l . 2ll |. Among the self¬ 
similar processes arising in the limit in this paper, the most widely studied one is the 
fractional Brownian motion. Fractional Brownian motions, as generalizations of Brownian 
motions, have been widely studied and used in various areas of probability theory and ap¬ 
plications. These processes are the only centered Gaussian processes that are self-similar 
with stationary increments. The investigation of fractional Brownian motions dates back 


to Kolmogorov [16] and Mandelbrot and Van Ness [18]. As for limit theorems, there are 


already several models that converge to fractional Brownian motions in the literature. 
See Davydov [ 7 ], Taqqu jl3], Enriquez jjioj, Kliippelberg and Kuhn Kjj], Peligrad and 
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Sethuraman [20|, Mikosch and Samorodnitsky |19j, Hammond and Sheffield 12] for a few 
representative examples. A more detailed and extensive survey of various models can be 
found in Pipiras and Taqqu (2ll ] . Besides, we also obtain limit theorems for bi-fractional 
Brownian motions introduced by Houdre and Villa [Hj. They often show up in decompo¬ 
sitions of self-similar Gaussian processes; see for example [24J, |l7j . However, we do not find 
other discrete models in the literature. As for limit theorems illustrating decompositions of 
Gaussian processes as ours do, in the literature we found very few examples; see Remark [H 

Our results connect the Karlin model, a discrete-time stochastic process, to several 
continuous-time self-similar Gaussian processes and their decompositions. By introduc¬ 
ing new discrete counterparts, we hope to improve our understanding of these Gaussian 
processes. In particular, the proposed randomized Karlin model can also be viewed as cor¬ 
related random walks, in a sense complementing the recent model introduced by Hammond 
and Sheffield jl2[ that scales to fractional Brownian motions with Hurst index H G (1/2,1). 
Here, the randomized odd-occupancy process ( U £ below) is defined in a similar manner, 
and scales to fractional Brownian motions with H G (0,1/2). 

The paper is organized as follows. Section [2] introduces the model in details and present 
the main results. Section [3] introduces and investigates the Poissonized models. The de- 
Poissonization is established in Section [4] 


2. Randomization of Karlin model and main results 

2.1. Karlin model and its randomization. We have introduced the original Karlin 

model in Section [T] Here, we specify the regular variation assumption. Recall the definition 
of (pk)k> i- We assume that pk is non-increasing, and define the infinite counting measure 
v on [0, 00 ) by 

z ' Pi 

3> 1 

for any Borel set A of [0, 00 ), where 8 X is the Dirac mass at x. For all t > 1, set 

(3) iAt) \= u([0 ,£]) = max {.7 > 1 | Pj > 1/t}, 

where max0 = 0. Following Karlin [l4[, the main assumption is that v(t) is a regularly 
varying function at 00 with index a in (0,1), that is for all x > 0, lim^oo u(tx)/u(t) = x a , 
or equivalently 

(4) v{t) = t a L(t), t > 0, 

where L{t) is a slowly varying function as t —> 00 , i.e. for all x > 0, lim^oo L(tx)/L(t) = 1. 
For the sake of simplicity, one can think of 

Pk ~ Ck~v for some a G (0,1) and a normalizing constant C > 0. 

k —^00 

This implies u(t) ~ C a t a . 

t—> OO 
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We have introduced two random processes considered in Karlin |14| : the occupancy 
process and the odd-occupancy process as 




{Yn,k* 0 } 


and U*{n) := ^ 1 


{Y n ,k is odd} 


k> 1 


k> 1 


respectively. To introduce the randomization, let £ := (e k ) k >i be a sequence of i.i.d. 
Rademacher random variables (i.e. P(£fc = 1 ) = P(£& = — 1 ) = 1 / 2 ) defined on the same 
probability space as the ( Y n ) n >i and independent from them. In the sequel, we just say 
that £ is a Rademacher sequence in this situation (and thus implicitly, £ will always be 
independent of (F n ) n >i)- 

Let £ be a Rademacher sequence. We introduce the randomized occupancy process and 
the randomized odd-occupancy process by 

Z £ (n) := 0 } and U £ (n) := Y £ k 1 { r„, fc is odd}- 

fc>i fc>i 

We actually will work with decompositions of these two processes given by 


Z £ {n ) = Zl{ri) + Zf(n) and U £ (n) = Ul(n ) + I/f(n), 


where 



(5) 

Z 1H := Y £ k (1RW0} ~Pk(n)) and Z £ (n) := 

Y £ kPk{n), n> 1, 


k> 1 

fc>i 

(6) 

U{(n) := Y £k OkWfc odd} - g*(n)) and U^(n) 

:= Y £ kQk(n), n> 1, 


fc>i fc>i 


with for all k > 1 and n > 1 , 


Pk{n ) := P (Y n , k ^ 0) = 1 - (1 - p k ) n , 
q k (n) := P (Y n>k is odd) = ^(1 - (1 - 2 p k ) n ). 

In the preceding definitions, the exponent £ refers to the randomness given by the 
Rademacher sequence (£fc)fc>i- Nevertheless, in some of the following statements, the se¬ 
quence of (£fc)fc>i can be chosen fixed (deterministic) in {—1,1} N . Then the corresponding 
processes can be considered as “quenched” versions of the randomized process. For this 
purpose, it is natural to introduce the centering with p k (n ) and q k [n ) respectively above. 
Actually, we will establish quenched weak convergence for Z\ and U[ (see Theorem [3] and 
Remark [T]). With a little abuse of language, for both cases we keep £ in the notation 
and add an explanation like l for a Rademacher sequence s' or ‘ for all fixed £ G { — 1,1} N \ 
respectively. 


2.2. Main results. As mentioned in the introduction, we are interested in the scaling 
limits of the previously defined processes. We denote by Z)([0,1]) the Skorohod space of 
cadlag functions on [0,1] with the Skorohod topology (see [2J). Throughout, we write 

o n := n a/2 L(n) 1/2 , 
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where a and L are the same as in the regular variation assumption (J3]). Observe that 
u(n) = L(n ) = a n = 0 for n < 1/pi- Therefore, when writing l/cx n we always assume 
implicitly n > l/p\. We obtain similar results for Z £ and U e . Below are the main results 
of this paper. 

Theorem 1. For a Rademacher sequence e, 

On 

in (D([0, l])) 3 , where Z^Z^Z are centered Gaussian processes, such that 

Z = Zi + z 2 , 

7L\ and Z 2 are independent, and they have covariances 

Cov(Zi(s), Zi(t)) = T(1 — a) ((s + t) a — max(s, t ) a ), 

Cov(Z 2 (s), Z 2 (t)) = T(1 - a) (s Q + t a -(s + t ) a ), 

Cov(Z(s), Z(t)) = T(1 — a) min(s, t) a , s,t> 0. 

Theorem 2. For a Rademacher sequence £, 

in (D([0, l])) 3 , where Ui,U 2 ,U are centered Gaussian processes such that 

U = U!+U 2 , 

Ui and U 2 are independent, and they have covariances 

Cov(Ui(s),Ui(£)) = T(1 — o)2“ -2 ((s + t) a — |f — s| Q ), 

Cov(U 2 (s), U 2 (t)) = T(1 - a)2 a ~ 2 (s a +t a -(s + t ) Q ), 

Cov(U(s), U(£)) = T(1 - a)2 a ~ 2 ( s a + t a -\t- s|“), s,t> 0. 


To achieve these results, we will first prove the convergence of the first (Z[ and Uf) and 
the second (Z | and £/f) components, respectively. For the first components we have the 
following stronger result. 


Theorem 3. For all fixed e £ { — 1,1} N , 

'Zf(KI)' 


(J Ti 


(ZiW) 


te[o,i] 


tG[0,l] 


and 


CTr, J , 


(Ul (t)) 


te[o,i] 


[o,i] 


in .D([0,1]), where Z x and Ui are as in Theorems Q] and[H 


Remark 1. Theorem [3] is a quenched functional central limit theorem. In particular, 
when taking £ = 1 = (1,1 ,...)gN, Theorem [3] recovers and generalizes the central limit 
theorems for Z*{n) and U*(n ) established in Karlin jlij (formally stated in (JTj) ): the (non- 
randomized) occupancy and odd-occupancy processes of the Karlin model scale to the 
continuous-time processes Zi and Ui, respectively. Moreover, as the limits in Theorem |3] 
do not depend on the value of e, this implies the annealed functional central limit theorems 
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(the same statement of Theorem [3] remains true for a Rademacher sequence e), and entails 
essentially the joint convergence to the decomposition. 


Now we take a closer look at the processes appearing in Theorem [T] and Theorem [2] and 
the corresponding decompositions. The decomposition of U is a special case of the general 
decompositions established in Lei and Nualart [13] for bi-fractional Brownian motions. 
Recall that a bi-fractional Brownian motion with parameter H £ (0,1), K £ (0,1] is a 
centered Gaussian process with covariance function 


(7) 


«"•*(*,() = ^(( t™ +s ™) K 


It is noticed in 17 that one can write 


t-s\ 2HK ). 


( 8 ) 


_ ( + 2HK , 2 HK 

2 K V 6 


|t - s\ 2HK ) = R h,k (s, t) + ^ (t 2HK + s 2HK 


(t 2H + s 2H ) K ) , 


where the left-hand side above is a multiple of the covariance function of a fractional 
Brownian motion with Hurst index HK, and the second term on the right-hand side above 
is positive-definite and hence a covariance function. Therefore, (J5]) induces a decomposition 
of a fractional Brownian motion with Hurst index HK into a bi-fractional Brownian motion 
and another self-similar Gaussian process. 

Comparing this to Theorem [2] we notice that our decomposition of U corresponds to 
the special case of (JHJ) with H = 1/2, K = a. Up to a multiplicative constant, U is a 
fractional Brownian motion with Hurst index H = a/2. The process HR is the bi-fractional 
Brownian motion with H = 1/2, K = a, and it is also known as the odd-part of the 
two-sided fractional Brownian motion; see Dzhaparidze and van Zanten [8|. That is 


(Ui(f)) 


U2»r(i - tt ) (i ( 


t> 0 


l a/2 (t) -B“ /2 (-R) 


t> o 


where B"/ 2 is a two-sided fractional Brownian motion on M with Hurst index a/2 £ (0,1). 
The process HR admits a representation 


U 2 (t) = 2 q 7 2-1 


a i (1 
o 


^St\ — 


a+1 


s ), t > 0, 


where (B(t))i S [o.i] is the standard Brownian motion. It is shown that HRR) has a version 
with infinitely differentiable path for t £ (0, oo) and absolutely continuous path for t £ 
[0, oo). At the same time, HR also appears in the decomposition of sub-fractional Brownian 
motions 0,0- 

For the decomposition of Z in Theorem [I] to the best of our knowledge it is new in 
the literature. Remark that Z is simply a time-changed Brownian motion (Z(t)R>o = 

r(l-ct)(B(r)) t > 0 , and that Z 2 = J 2 a / 2+l \] 2 - The latter is not surprising as the coefficients 
qk(n) and Pk(n) have the same asymptotic behavior. However, we cannot find related 
reference for Zi in the literature. The following remark on Zi has its own interest. 
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Remark 2. The process Z] may be related to bi-fractional Brownian motions as follows. 
One can write 

(s 1/a + f 1/Q ) Q - \s-t\ = 2 

That is, 

(V(t)) t >0 = (2 Mt 1/a ) + Mt 1/a )) t > 0 , 

where Zi and Z 2 are as before and independent, and ¥ is a centered Gaussian process with 
covariance 

Cov(V(s),V(t)) = T{l-a)2 a R 1 ^ a (s,t). 

Therefore, as another consequence of our results, we have shown that for the bi-fractional 
Brownian motions, the covariance function R H,K in (j7j) is well defined for H = 1/(2 a), K = 
a for all a 6 (0,1). The range a G (0,1/2] is new. 


(s 1/a + t i/ a y _ max ( S; ^ + s + t - ( s 1/a + t 1/a ) 


, s, t > 0. 


To prove the convergence of each individual process, we apply the Poissonization tech¬ 
nique. Each of the Poissonized processes Zf, Z|, Uf, U| is an infinite sum of independent 
random variables, of which the covariances are easy to calculate, and thus the finite¬ 
dimensional convergence follows immediately. The hard part for the Poissonized processes 
is to establish the tightness for Z{ and U{. For this purpose we apply a chaining argu¬ 
ment. Once the weak convergence for the Poissonized models are established, we couple 
the Poissonized models with the original ones and bound the difference. The second tech¬ 
nical challenges lie in this de-Poissonization step. Remark that Karlin 14j] also applied the 
Poissonization technique in his proofs. Since he only worked with central limit theorems 
and us the functional central limit theorems, our proofs are more involved. 


Remark 3. One may prove the weak convergence [Z £ {\nt\) / =>■ (Z(t))t e [ 0 ,i] and 

(U £ ([nt})/a n )t£[o,i\ => (TJ(t))t e [o,i] directly, without using the decomposition. We do not 
present the proofs here as they do not provide insights on the decompositions of the limiting 
processes. 


Remark 4. We are not aware of other limit theorems for the decomposition of processes 
in a similar manner as ours, but with two exceptions. One is the symmetrization well 
investigated in the literature of empirical processes 
empirical distribution function 


28]. Take for a simple example the 


1 v—> 

^n(t) '■= — ^-{Xj<t} 

t' 1 
2—1 

where Xi, ... are i.i.d. with uniform (0,1) distribution. By symmetrization one con¬ 
siders an independent Rademacher sequence e and 

^ n i n i n 

:= “ := - (^i^l - 0 and F n 2 W : = - 

i=1 i— 1 i= 1 

It. is straight-forward to establish 

\/^( F nW> F n 1 W ; F n 2 W)te[0,l] => (B(t), B(t) - tB(l), tB(l)) te[0> i]. 
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This provides an interpretation of the definition of Brownian bridge via M brid9e (t) := 

e [0,1]. 

The other example of limit theorems for decompositions is the recent paper by Bojdecki 
and Talarczyk [sj who provided a particle-system point of view for the decomposition of 
fractional Brownian motions. The model considered there is very different from ours, and 
so is the decomposition in the limit. 

2.3. Correlated random walks. We first focus our discussion on U £ . One can interpret 
the process U e as a correlated random walk by writing 


(9) 


U‘(n) = X 1 + --- + X„ 


for some random variables (A*)j>i taking values in {—1,1} with equal probabilities, and 
the dependence among steps is determined by a random partition of N. Viewing Xj as the 
step of a random walk at time i, U £ in (jUJ) then represents a correlated random walk. To 
have such a representation, recall (V;)*>i and consider the random partition of N induced 
by the equivalence relation that i ~ j if and only if V = Yj‘ that is, the integer i and 
j are in the same component of the partition if and only if the i-th and j-th balls fall in 
the same box. Once (V n ) n >i is given and thus all components are determined, one can 
define (X n ) n >i as follows. For each k > 1, suppose all elements in component k (defined as 
{i : Yi = k }) are listed in increasing order rii < n 2 < ■ ■ ■ , and set X ni := £*, and iteratively 
X n := —X 1lp . In this way, it is easy to see that each X n is taking values in {—1,1} with 
equal probabilities, and conditioning on (Y n ) n > i, Xj and Xj are completely dependent if 
i ~ j, and independent otherwise. The verihcation of (jUJ) is straight-forward. 

The above discussion describes how to construct a correlated random walk from random 
partitions in two steps. The first is to sample from a random partition. The second is 
to assign ±1 values to (X n ) n >i conditioned on the random partition sampled. A similar 
interpretation can be applied to another model of correlated random walks introduced in 
Hammond and Sheffield [l2j. The Hammond-Sheffield model also constructed a collection 
of random variables taking values in { — 1,1}, of which the dependence is determined by 
a random partition of Z, in form of a random forest with infinitely many components 
indexed by Z. There are two differences between the Hammond-Sheffield model and the 
randomized odd-occupancy process U £ : first, the underlying random partition is different: 
notably, the random partition in the infinite urn scheme is exchangeable, while this is not 
the case for the random partition introduced in Z in jl2|; rather, the random partition 
there inherits certain long-range dependence property, which essentially determines that 
the Hurst index in the limit must be in (1/2,1). Second, for Xj in the same component 
of random partitions, Hammond-Sheffield model set them to take the same value (all 1 or 
all —1 with equal probabilities), independently on each component. 

The alternative way of assigning values for X, in the same component is the key idea 
in our framework. Clearly this has been considered by Spitzer [|26j and Karlin [3], if not 
earlier. Actually, Hammond and Sheffield [12] suggested, as an open problem, to apply the 
alternative way of assigning values to their model and asked whether the modified model 
scales to fractional Brownian motions with Hurst index in (0,1/2). In our point of view, 
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in order to obtain a discrete model in the similar flavor of the Hammond-Sheffield model 
that scales to a fractional Brownian motion with Hurst index He (0,1/2), the alternative 
way of assigning values is crucial, while the underlying random forest with long memory is 
not that essential. Our results support this point of view. Actually, looking for a model in 
a similar spirit to complement the Hammond-Sheffield model as the discrete counterparts 
of fractional Brownian motions was another motivation for this paper. At the same time, 
the aforementioned suggestion in [|l2| remains a challenging model to analyze. 

As for the occupancy process, similarly one can view Z £ as a correlated random walk. 
The random partition being the same, this time for each component k with elements 
ni < n 2 < ■ ■ ■, we set X ni = e k , X n . = 0, i > 2 to obtain 

Z £ = X 1 + --- + X n . 

The dependence of this random walk is simpler than the odd-occupancy process. 

3. POISSONIZATION 

Recall that we are interested in the processes Z e and U £ , and instead to deal with them 
directly we work with the decompositions Z £ = Z\ + Zf and U £ = U{ + U 2 with the 
components defined in (J5]) and (J6J). 

3.1. Definitions and preliminary results. The first step in the proofs is to consider the 
Poissonized versions of all the preceding processes in order to deal with sums of independent 
variables. Let IV be a Poisson process with intensity 1, independent of the sequence (Y n ) n >i 
and of the Rademaeher sequence £ considered before. We set 

N(t) 

A r k(t) '■= ^ 1 {Y e =k}, t > 0, k > 1. 
t= i 

Then the processes N kl k > 1, are independent Poisson processes with respective intensity 
p k • We now consider the Poissonized processes, for all t > 0, 

Z ( [t ) . ^ ^ and U ( t) . ^ ^ £k\{N^(t) is odd}- 

k >1 k >1 

These Poissonized randomized occupancy and odd-occupancy processes have similar de¬ 
compositions as the original processes 

= Zf + Z 2 e and U £ = \J{ + U\ 


(t) := (l{iv fc (t)^o} ~Pk(t)) , Z £ {t) ■= ^Ekpkit), 

k >1 k >1 

(t) '■= 'y ^ Ek is odd} Qk(t)^) j (t) ’.= ^ ] Ek<jk(t)i 

k >1 k >1 


with 
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and 

Pk(t) ■■= nN k (t) ■+ o) = 1 - e~ Pkt , 

q k {t) \= ¥(N k (t) is odd) = ^(1 - e~ 2pkt ). 

Using the independence and the stationarity of the increments of Poisson processes, we 
derive the following useful identities. For all 0 < s < t and all k > 1, 

(10) 0 < p k (t) - p k (s) = (1 - Pk(s))p k (t - s) < p k (t - s ), 

(11) 0 < q k (t) - q k (s ) = (1 - 2 q k (s))q k (t - s) < q k (t - s). 

Note that, in particular, the functions p k and q k are sub-additive. Further, we will have to 
deal with the asymptotics of the sums over k of the p k or q k . For this purpose, recall that 
(see [3, Theorem 1]) the assumption (j3J) implies 

(12) V(t) :=^(l- e - pfct ) ~r(l-a)t Q L(t), ast^oo. 

k> 1 

We will need further estimates on the asymptotics of V(t) as stated in the following lemma. 


Lemma 1. For all 7 G (0,a), there exists a constant C 7 > 0 such that 

V ( nt ) < Cjf'a 2 , uniformly in t G [0,1], n > 1. 


Proof. Recall the definition of the integer-valued function v in 
we have for all t > 0 , 


By integration by parts, 


V(t) = (1 - e-^ x )du(x) = / x~ 2 e- 1/x u{tx)dx. 

Jo Jo 

Observe that v{t) = 0 if and only if t G [0,1/pi) by definition, and in particular L{t) = 0 
if and only if t G [0,1/pi). Thus, 


V(nt) 


oi 


Now we introduce 


and obtain 


1/(ntpi) 


x 2 e l ^ x v{ntx)dx = t° 


„a- 2 „-l /x L(ntx) 


1/ (ntpi) 


L (n ) 


dx. 


T *( f \ _ j £(1/Pi) if t G [0,1/pi) 

1 Lit) if t G [1/pi, 00 ) ’ 


V ( nt ) 


cr: 


2 — 


< t c 


L*(n) 


Let S > 0 be such that a + <5 < 1 and a — 5 > 7 . Observe that L* has the same asymptotic 
behavior as L by definition. In addition, L* is bounded away from 0 and 00 on any compact 
set of [0, 00 ). Thus, by Potter’s theorem (see [i, Theorem 1.5.6]) there exists a constant 
Cs > 0 such that for all x, y > 0 


O(x) 

L’W 


< Cs max 
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We infer, uniformly in t G [0,1], 


V(nt ) 


at 


< Cst a ( x a 2 e l ! x max (jfx) 5 , (tx) dx 


< Cst' 


t-S 


X 


a-S-2 e -l/x dx + / x a+5 ~ 2 e- 1 /X dx ) , 


and both integrals are finite (the second one because we have taken 6 such that a + 5 < 1). 
Further, t a ~ 5 < t 7 for all t G [0,1] and thus the lemma is proved. □ 


3.2. Functional central limit theorems. We now establish the invariance principles for 
the Poissonized processes. 


Proposition 1. For all fixed e G {—1,1} N , 
(nt) 


(Jrr 


(Zi (<)) 


(€[ 0 , 1 ) 


and 


Uf (nt) 


ie[o,i] 


(Jn 


(Ui(t)) te[0il] ) 


te[o,i] 


in D([0,1]), where is as in Theorem[ 7] and Ui is as in Theorem O 


Proof. In the sequel £ G {—1,1} N is fixed. The proof is divided into three steps. 

(i) The covariances. Using the independence of the N k , and that e\ = 1 for all k > 1, we 
infer that for all 0 < s <t, 


Cov (z{{ns), Zl(nf)^j = X (P (N k (ns) ^ 0, N k (nt) ^ 0) - p k (ns)p k (nt)) 

k> 1 

= J2 ((1 - e ~ PknS ) - (! - e~ Pkns ){l - e~ Pknt )) = V(n(s + t)) - Vint), 

k> 1 

whence by 

lim \ Cov (ZUns ), Z{int) \ = T(1 — a) ((s + t) a — t a ). 

n—>oo (j" \ / 

For the odd-occupancy process, using the independence and the stationarity of the incre¬ 
ments of the Poisson processes, for 0 < s < t, 

Cov (ui(ns), Ul(nt)^J = E (F{N k {ns) is odd, N k (nt) is odd) — q k {ns)q k {nt )) 

k> 1 

= X (qk{ns){l - q k (n(t - s))) - q k {ns)q k (nt )) 

k> 1 

= i - e- 2pkns ){e- 2pkn ^- s) + e~ 2pknt ) = ^ (U(2 n(t + s)) + V(2n(t - s ))). 

^ fc>i 

Thus, again by ffT2jh 

lim Cov (u^ins), Uf(nt) \ = T(1 — a)2"~ 2 ((f + s) Q — (t — s) a ). 

n—>■ oo \ / 
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(ii) Finite-dimensional convergence. The finite-dimensional convergence for both processes 
is a consequence of the Lindeberg central limit theorem, using the Cramer-Wold device. 
Indeed, for any choice of constants a\,... , ad G M, d > 1, and any reals ti,...,td G [0,1], 
the independent random variables ^f =1 ai(l{jv fc (nt;)^o} — Pfc(ntj)), k > 1, n > 1 are 
uniformly bounded. This entails the finite-dimensional convergence for (Zf(nt)/a n )t^[ o,i]- 
The proof for (Uf(nt)/a n ) t e[o, l] is similar. 

(m) Tightness. The proof of the tightness is technical and delayed to Section lT3l □ 

Proposition 2. For any Rademacher sequence £ = (£ k )k> i, 

Z 2 (nt)\ (Uo(nt)\ . ... 

~ I (^2(^))t£[0,l] an d I ~ I ^ (U2(^))tg[ 0)1 ] , 

C'n / \ C'n / 

/ te[ 0 , 1 ] \ / tG[ 0 ,l] 

in £)([0,1]), where Z 2 is as in Theorem{T\ and U 2 is as in Theorem [H 

Proof. First remark that, since for all t > 0, quit) = |pfc(2£), we have I7f(t) = |Zf(2t). 
Thus the second convergence follows from the first one. 

(i) The covariances. Since the e k are independent, using (fT2]h we have for all t, s > 0 

4 Cov(Z|(nt), Zf M) = 4 E H4)Pk(nt)p k (ns) = 4 E^ “ e^X 1 ~ e _PfcM ) 

= — (P(nt) + V(ns) — P(n(t + s))) —> T (1 — a) (t a + s a — (t + s)“) as n —)■ 00 . 
ot 


(ii) Finite-dimensional convergence. Since Zf is a sum of independent bounded random 
variables, the finite-dimensional convergence follows from the Cramer-Wold device and the 
Lindeberg central limit theorem. 


(in) Tightness. Let p be a positive integer. By Burkholder inequality, there exists a 
constant C p > 0 such that for all 0 < s < t < 1 , 

p 


E 


(Tri 


( Z e 2 (nt) - Z e 2 (ns )) 


2 p 


< 


Cp E (Pk(nt) - p k (ns)f 


(J n 

< C 1 

— 2 P 

On 


K k> 1 


EftW *-®)) 2 =c p 


K k> 1 


V(n(t — s)) 


ot 


We now use Lemma [TJ Let 7 G (0, a). There exists C~ > 0 such that 


E 


— (Zf(rat) - Zf(ras)) 


<7 


2 p 


< C p Cf.\t — s| 7P uniformly in \t — s| G [0,1]. 


Choosing p such that yp > 1, this bound gives the tightness { 2 , Theorem 13.5]. 


□ 











14 


OLIVIER DURIEU AND YIZAO WANG 


3.3. Tightness for Z\ and U{. Recall that e £ {—1,1} N is fixed. Let G be either Z\ or 
U[. To show the tightness, we will prove 


(13) limlimsupP sup \G{nt) — G(ns)| > rja n = 0 for all p > 0. 

<5_ ^° n—\ |t-s|<5 J 

The tightness then follows from the Corollary of Theorem 13.4 in j2|. To prove (fT3lh we 
first show the following two lemmas. 

Lemma 2. Let G be either Zf or Uf. For all integer p > 1 and 7 £ (0, a), there exits a 
constant C P:1 > 0 such that for all s,t £ [0,1], for all n > 1, 

(14) E|G(ns) - G(nt)\ 2p < C Pi7 (|t - s\^ p a 2p + |t - s^a 2 ) . 

Lemma 3. Let G be either Z\ or Uf. For all t < s < t + S, 

(15) | G(t) — G(s)| < N(t + 5) — N(t ) + <5, almost surely, 

where N is the Poisson process in the definition of Zf and U[. 

A chaining argument is then applied to establish the tightness by proving the following. 

Lemma 4. If a process G satisfies CH> and (fl5l) for a Poisson process N, then (1T31) holds. 

Proof of Lemma\^ We prove for G = Uf. The case G — Z\ can be treated in a similar 
way and is omitted. In view of Lemma [T| it is sufficient to prove that for all p > 1 and all 
0 < s < t < 1, 

(16) E|G(t) - G(s) | 2p < C p (V (2 (t - s)) p + V(2(t - s))), 

with the monotone increasing function V defined in f[T2|l . We prove it by induction. For 
p — 1, by independence of the TV*,, we have 

E|G(t) G(s) | ''f ^ Var (l {Nk(t) is odd} ^{^(s) is odd}) 

k> 1 

A 'y ] E (l{IVfc(t) is odd} — l{N fe (s) is odd}) A y ) Qkif ~ s ) — ^ _ S ))' 

fc>l k >1 

Let p > 1 and assume that the property holds for p — 1. We fix 0 < s < t, and simplify 
the notations by setting 

A k ■ l{IVj.(i) is odd} Qkif) (l{Ay(.s) is odd} 9fe(^)) ■ 
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Note that \X k \ < 2 for all k > 1. Since (X fc ) fc >i is centered and independent, it follows 
that 


E|G(«)-G(s)| 2p = E E { x l--- x l 

k\ ^ • 5 kp ^ 1 

< E E ( x l ) E ( X L •" n 

k\ •)•••! kp ^ 1 


E E ( K--xl 


k i,..., Aip ^ 1 
AiiG{Ai 2 ,...,Aip} 


< fE E «)+ 4 (p -!)) E E ( A t" ■ x l 

\All>l / A 2 ,..,fcp>l 

By the induction hypothesis, we infer 

E|G(t) - G(s)| 2p < Pv(2 (i - a)) + 4(p - id C_! (V(2(t - a))'- 1 + V(2(t - a)) , 

and we deduce (TT 6 T) using the monotonicity of V. 

Proof of Lemma 0 Let t < s < t + 8. Recalling (TTU]) . we have 

l^i(s) - Zf(t)\ < Y |l{N fc ( s )^o} - l{jv*(t)*o}| + Y \Pk( s ) ~ Pk(t)\ 

k> 1 fc>l 

< Y l{JV*( S )-JV*(t)^ 0 } + J>(- ~ ^ 

k> 1 k>\ 

< N(s ) - N(t) + E (N(s - t)) < N(t + 5)- N(t) + 8. 

Similarly, recalling (TIT]) . 

\Ui (s) — U x (t) | < ^ ] 11 (JV*.(s) is odd} is odd} | + ^ ^ I Qk( s ) ~ Qk{t) I 


□ 


k> 1 


k> 1 


- ^ l{iv fc (s)-iv fc (t)^o} + - t) < N(t + 5) - N(t) + 8. 


k> 1 


fc>l 


□ 


Proof of Lemma\^ Let 77 > 0 be hxed. For 5 G (0,1) and r := + 1, we set R := for 

z = 0,..., r — 1, and t r := 1. By [3, Theorem 7.4], we have 

(17) P ( sup | G(nt) — Cr(ns)| > 9r)cr n ] < V^P ( sup |G(ns) — G(nti_ i)| > 3rja^\ . 

\^|t—s | <5 J ^ = | \ti-i<s<ti J 

The sequel of the proof is based on a chaining argument. Fix z e {1,..., r}. For all k > 1, 
we introduce the subdivision of rank k of the interval [L-i, t,;] : 


x k ,£ ■= ti -1 + £-£, for k > 1 and £ = 0 ,..., 2 k . 
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For s G and n > 1, we define the chain s 0 := < Si < ... < s kn < s, where 

for each k, s k is the largest point among (xk/)e=o ... 2 k °f rank k that is smaller than s, and 
where we choose 


(18) 


k n . 


log 2 (l{e - 1) — \ 
\ V*nJ 


+ 1 . 


This choice of k n will become clearer later. For tj_i < s < ti, we write 


(19) \G(ns) - G(nti-i)\ < ^ G(ns k ) - G(ns k - 1 )| + |G(ns) - G(ns kn )\. 

k =1 

Since we necessarily have s k = s k -\ or s k = s k -1 + we infer that for all k > 1, 

(20) \G(ns k ) - G(ns fe _i)| < max \G{nx k t ) - G(nx k *_i)|. 

e=i,...,2 k 


Now, by Lemme [31 we get 

\G(ns) - G{ns kn )\ < N(n(s kn + 82 ~ kn )) - N(ns kn ) + n52~ kn 

(21) < max (N(n(x kn/ + 82~ kn )) - N(nx kn/ ))+n82~ kn . 

£=0,...,2 k n-l 

Observe that our choice of k n in (fT8l) gives n52~ kn < r]a n . By (TT9|) . (1201) and (J2TJ), we infer 


limsupP ( sup |G(ns) — G(nij_i)| > 3 rja r 


( 22 ) 

(23) 


< lim 


imsupP ) max I G(nx k A — G{nx k t- 1 )| > i)o n 

n—>oo \ “ e=l 2 k 


,k =1 


+ lim sup P 


max (N{n(x k i + 52 kn )) - N{nx kn M > rja r 

e=o,...,2 k n~i ' 


For Q2SD , using exponential Markov inequality and the fact that E(e Ar ^- ) ) = e x ^ e we infer 


P 


max [N(n(x k i + 52 kn )) - N(nx kn M > r]a r 

£=!,.. „2 k 


< 2 kn F (N(n52~ kn ) > rja n ) < 2 kn e nS2 fcn (e-i)-wn. 


Again by the choice of k n in (1T8li . 2 kn < 4(e — l)n5/(rjcr n ) and 2 kn < rja n /(2(e — 1 )n5). 
Thus, the above inequality is bounded by 4(e — l)n5/(r]a n )e~ 2 rir7n ^ which converges to 0 
as n —> oo. So, the term ()23l) vanishes and it remains to deal with (l22l) . Let rj k := k ^ +l) , 
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k > 1, so that J2 fc>1 r] k = p. We have 



k n 2 k 

sEE P (| G(nx k/ ) - G(nx M _i)| > r] k a n ). 

fc=i u=i 


Now, fix 7 e (0, a) and let p > 1 be an integer such that 779 > 1 . Using Markov inequality 
at order 2 p and the 2 p-th moment bound (fT-flh we get 


k n 2* 


P| ^ max fc \G(nx k /) — G(nx k ,e- i)| > poy J < EE % 

fc =1 £ 


2p E |G(nx M ) - G(nx M i)| 


2 p 


k n 2 k 


< C prf ^ r lk 2p ( \ x k,e ~ x k,e~\ | 7P + 

k =1 1= 1 


fc=l £=1 

^k,£— 11 

w n 


cr, 


2 p 


< p- 2p 2 fc(1 “ 7p) + C' Pi7 5 7 n Q(1 - p) L(n) 1 - p ^77“ 2p 2 fc ( 1 - 7) . 


fc=i 


fc=i 


In the right-hand side, since yp > 1, the series in the first term is converging and is 
independent of n. The sum in the second term is bounded, up to a multiplicative constant, 
by 2 kn< ' 1 ^ which is of order 77,W“/ 2 K 1- 7') (l iere anc [ next line, up to a slowly varying 
function). Thus, the second term in the right-hand side is of order n 1 -“P+«/2-7+7«/ 2 < 
n,i— 7 P+ ( a— 7 )(i—p) an d vanishes as n goes to 00 , again because we have assumed yp > 1. So 
for (l22]h we arrive at 


lim sup P 

n—}-oo 


kn \ 

max \G(nx k ,e) ~ G(nx k ,e-i)\ > pcr„ 

U «= i <-> 2fc J 


< C5 1P 


for some constant C independent of <5 and p. From (fT71h we conclude that 


lim sup P 

n—>■ oo 


sup 

|t—s|<5 


| G(nt) — G(ns)| > 9poy 




57 p 


which goes to 0 as 5 j. 0. This yields (TT51) . 


□ 


Remark 5. For the Poissonized model, we can establish similar weak convergence to the 
decompositions as in Theroems [1] and [21 by adapting the proofs at the end of Section [U 
We omit this part. 
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4. De-Poissonization 


In this section we prove our main theorems. 

Recall the decompositions 

Z £ = Z{ + Z% 

and 

u £ = U{ + C/|, 

and 



Z £ = Z\ + Zf 

and 

u £ = fjl + f/|. 


Note that G £ and G £ , for G being Zi, Z 2 , U\, f/ 2 respectively, are coupled in the sense that 
they are defined on the same probability space as functionals of the same £ and (Y n ) n > 1 . We 
have already established weak convergence results for Zf, Zf, f/|. The de-Poissonization 
step thus consists of controlling the distance between G £ and G £ . We first prove the easier 
part. 


4.1. The processes Zf and U 2- 

Theorem 4. For a Rademacher sequence e, 
'Zl([nt\)\ 

/ *€[ 0 , 1 ] 


G r\ 


(Mt)) 


(€[ 0 , 1 ] 


and 


U! 


(M) \ 
cr„. J, 


mt)) 


te[o,i] 


iG[0,l] 


in .D([0,1]), where Z 2 and U 2 are as in Theorems [7] and[H 

Proof. Thanks to the coupling, it suffices to show for all £ G {—1,1} N fixed, 

\G £ ([nt\) — G £ (nt)\ 


lim sup 

n— *°° £€= [ 0 , 1 ] 


= 0 


(Jr) 


in probability, with G being Z 2 , f/ 2 respectively. We actually prove the above convergence 
in the almost sure sense. Observe that for all £ G {—1,1} N , 


I - Z £ (nt) | < I Pk(nt) - p k ([nt\)\, 

k> 1 

I U^{\nt\) - U£(nt)\ < Y I Qk(nt) - q k ([nt\)\. 


k> 1 


Thus, the proof is completed once the following Lemma [5] is proved. 
Lemma 5. The following limits hold: 


□ 
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Proof. By triangular inequality, for all n > 1, t > 0, 

Y I Pkint) -Pk([nt\) | < Y \Ml nt \) - hint) | + ^ |p fc (M) ~ Pk{[nt\) |. 

fc>i fc>i fc>i 


First, note that for all k > 1, 

|p*(lMl) -Pfe(^)| < + 1) -Pfc(L nt J) = e _Pfc|ntJ (l - e _Pi: ), 


and thus, 

Y^ k ^ nt ^ -Pfe( nt )l < = 1 ‘ 

fc>i fc>i 

Further, if [ntj > 1, using that e~ my — (1 — y) m < T(i — e~ my ) for all 0 < y < 1 and 
m 6 N, we have 


Y IPfc(lMl) -Pfc(M)l = 5^ (e PfeNJ - (1 -p fc ) L ” tJ ) 

fc>i fc>i 


< 



_ e -Pfc l nt l 


rqntj) 

[nij 


which is bounded (since V{n)/n —> 0 as n —> oo). We thus deduce ()24lh The proof for ()25l) 
is similar and omitted. □ 


4.2. The processes Zf and U{. In this section we prove Theorem [31 The coupling of 
Z \, Z\ and F/f, f/f respectively takes a little more effort to control. 

Proof of Theorem 0 Let N be the Poisson process introduced in Section [3] and denote by 
t j the z-th arrival time of N, i > 1, namely r,- := inf{t > 0 | N(t) — i}. We introduce the 
random changes of time A n : [0, oo) —> [0, oo), n > 1, given by 

A n (t) := M, i > 0 . 

n 

By constructions, we have 

Z £ ([nt\) = Z e (n\ n (t )) and U e ([nt\) = U £ (nX n (t )), amost surely. 

These identities do not hold for the process Z\ or U[ but we can still couple Zf, Z\ and 
Uf, Uf via 

(26) ^i(lMl) = Z{(nX n (t )) + Y £ k(Pk{nX n (t)) - Pk([nt\)) 

k> 1 

(27) Ul(\nt\) = Ul(nX n (f)) + Y, £ k(Qki.nX n (t)) - q k ([nt\)). 

k> 1 

The proof is now decomposed into two lemmas treating separately the two terms in the 
right-hand side of the preceding identities. 
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Lemma 6. We have 

Zl(nK(t)) \ ^ ( Zl ( t )), 6(01| and 

Jn / 

/ te[ 0 , 1 ] 

in 1A([0,1]). 


f Uf(nX n (t)) \ 

V Gn ) te[0,i] 


(Ui(t)) te [ 0 ,i] 


Proof. We only prove the first convergence. The proof of the second is the same by replacing 
(Zf,Zi) by (£/f,Ui). For t > 0, by the law of large numbers, X n (t) —> t almost surely as 
n —> oo. Since the X n are nondecreasing, almost surely the convergence holds for all t > 0, 
and by Polya’s extension of Dini’s theorem (see |23|, Problem 127]) the convergence is 
uniform for t in a compact interval. That is 

lim sup |A n (t) —t | =0 almost surely, 

and X n converges almost surely to the identity function I in ZA ([0,1]). 

We want to apply the random change of time lemma from Billingsley [2j, p. 151]. However, 
X n is not a good candidate as it is not bounded between [0,1]. Instead, we introduce 

A *(t) := min (A n (t), 1 ), t > 0 . 

Observe that by monotonicity, 


sup |A* (t) — t\ < sup \X n (t) — t\. 
te[o,i] te[o,i] 

Thus, A* converges almost surely to I in H([0,1]). By Slutsky’s lemma and Proposition [TJ 
we also have 


(28) 


Zf(nt) 


(Jn 


> (^n(^))te[0,l] => ((Zi(t)) tG[0 ,l],l) 


te[o,i] 


in D([ 0 , 1 ]) x ZA([0,1]). Furthermore, since A* is non-decreasing and bounded in [0,1], thus 
by random change of time lemma we obtain 


(29) 


Zi( n K(t)) 


(J n 




te[o,i] 


in ZA( [0,1]) - To obtain the desired result we need to replace A* by X n . However, by 
dehnition, we only have, for all r) G (0,1) fixed, 

7 ^ K on [0,1 - T]}) < P > n) -A- 0 as n oo. 

It then follows that, restricting the convergence of (|29|) in ZA([0,1 — 77 ]), 


Zf(nX n (t)) 


(Jn 




tG [0,1—77] 


in ZA([0,1 — rj\). This is strictly weaker than the convergence in ZA([0,1]) that we are 
looking for. However, looking back we see an easy fix as follows. If one starts in (1281) with 
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weak convergence for Z\ and A* (modified accordingly) as processes indexed by a slightly 
larger time interval, say in £)([0,1/(1 — 77 )]) for any 77 G (0,1) fixed, the desired result then 
follows. □ 

In view of Lemma 0 the following lemma will be sufficient to conclude. 

Lemma 7. The following limits hold: 

lim — sup | p k (nt) — pk{nX n (t))\ = 0 in probability 
rwoo a n te[ o,i]“ 

and 

(30) lim — sup | q k {nt) — q k (nX n (t))\ = 0 in probability. 

n^oo a n te[ 0 ,i] g 

Proof. We only prove the second limit. The first one can be proved in a similar way and 
is omitted. We first introduce 

A n (t) ■= n*(\ n (t) - t) = n~^(Ty nt j - nt). 

Since r n is the sum of i.i.d. random variables with exponential distribution of rate 1, and 
since n~^(nt — \_nt.\ ) converges to 0 uniformly in t, by Donsker’s theorem and Slutsky’s 
lemma, we have 

(A ra (f)) te [ 01 ] =>- (B(f)) te j 0 ^ in £>([ 0 , 1 ]), 

where B is a standard Brownian motion. By the continuous mapping theorem, the se¬ 
quence sup t 6 r 0;1 i |A n (i)| weakly converges to sup te r 01 i |B(f)|, as n —> 00 . In particular, 
(sup te [ 0)1 j |A n (t)|) n >i is tight. So, for any 77 > 0, there exits K v > 0 such that for n large 
enough, 

(31) P ( sup |A n (t)| > KA < 77 . 

\*£[ 0 , 1 ] / 

Now, choose f3 G (0,1/2) and consider 

A n \= sup ^2\qk(nt) - q k (n\ n (t))\ and B n := sup ^ | q k (nt) - q k (nX n (t))\. 

t&[o,n-f>] te[n~P, 1 ] fc >j 

Concerning A n , using the bound in (fill) , we have 

A n < sup q k (n\ X n (t) -tj) = sup ^ q k (n\X n (n~ p t) - rT p tj). 

te[0,n-/3] fc > 1 7£[0,1] 

We can write 

X n (n~ p t) - n~ p t = Anl J^ . 

n 2 

For any 77 > 0 , using (J31]) , by monotonicity of q k (-), we infer that for n large enough 
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But 


- («‘- (1+ « /2 if,) = K (2n^/i Krl ) 


k> 1 


2 cr, 

r(l - a)2 a ~ 1 K^n~^ 


-> 0 as n ^ 00 . 

Thus, A n /a n converges to 0 in probability as n goes to oo. 

Concerning B n , using the identity (TIT]) , we can write 

B n = sup (1 - 2 q k (n min(A n (t), t))) q k (n\X n (t ) - t|) 

= Slip ^ e - 2 P^ mi n(A„(t),I)^( n | An ( t ) _ t |)_ 
iGln-^,1] fc > x 

Now, for t G [n”' 3 ,1], observe that if for some K > 0, |A„(t)| < K and nh~^ > 2K , then 

t 


A n (t) =t + n 1/2 Kit) > t — n 1/2 |A n (f)| > t 


n 


> -, 
2 ~ 2 


and thus min(A n (t),f) > Let r/ > 0 and A/, be as in (l3Tlh Assume n is large enough so 
that (13B holds and n^~ /3 > 2K V (which is possible since we have chosen (3 G (0,1/2)). By 
the preceding observation and by monotonicity of %{■), we infer 


P [ B n < sup V i 

V te[n-P, 1 ] 




q k \n-n *K n ) > 1 -rj. 


Now, using 1 — e x < x and then xe x < 1 — e x for x > 0, we get 


sup 'Ve- pknt q k (n^K) = Ve~ Pfcn1 ^- (1 - e“ 2pfen ^ 
te[n-*,i] tzi V ' 


k> 1 


< 


J2e~ Pknl P p k n^K v < ( x - e ~ Pknl n-^K = n fl - 1/2 V(n 1 -P)K ri . 


k> 1 


k> 1 


Thus, 


— sup Y,e-^%(n'-iK v )<AA 


(Jr 


V 


~ T(1 — a)K v n^ 2 A 1 f .J —» 0 as n —> oo, 

L(n) 1 / 2 

since (3 G (0,1/2). Thus B n /a n converges to 0 in probability as n goes to oo. We have 
thus proved f|3Uj) . □ 


To sum up, the desired results now follow from (I25|) and (12711 . Lemmas El El and [3 and 
Slutsky’s lemma. □ 
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4.3. The trivariate processes. Finally we conclude by establishing the main theorems. 


Proof of Theorems^ and\M We prove Theorem [TJ The proof for Theorem [2] is the same. 
We denote by £ the cr-£eld generated by the (£k)k >i which is then independent of (U n ) n >i • 
Note that the process Z\ is ^-measurable. For any continuous and bounded function / 
and g from Z1([0,1]) to M, we have 


ei/i MM2V Z «LM) 


(J 77 


(7 77 


= E 
< E 


E (/ 


Zf([n-\) 

@ n. 


£)g 


- E/(Z 1 )Ep(Z 2 


£) -E/(Z0 


E/(Z 1 )E^(Z 2 

sf(ln-J) 


% 


(J77 


— E^(Z 2 ) 


The hrst term goes to 0 as n —> oo thanks to Theorem [3] and the dominated convergence 
theorem. The second one goes to 0 as n —» oo thanks to Theorem [4j By [28l . Corollary 
1.4.5] we deduce that 


— (^(N). 2 2(M)) le[ o,i] 

Vn 


(Zi(t), z 2 (t)) tg [ 0il ], 


in £)([0, l]) 2 where 7L\ and Z 2 are independent. The rest of the theorem follows from the 
identity Z s = Z{ + Z\. □ 
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